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Abstract 

In  this  paper  we  examine  some  existing  procedures  for  testing  linear 
and  log-linear  regression  models,  especially  with  respect  to  the  trunca- 
tion of  the  disturbance  term  in  the  linear  model.   A  discrimination 
criterion  based  on  two  Lagrange  multiplier  test  statistics  is  suggested. 
Some  new  computationally  straightforward  exact  tests  which  treat  the 
linear  and  log-linear  regressions  explicitly  as  non-nested  models  are 
derived.   These  tests  are  also  generalized  to  take  account  of  truncation 
in  the  errors  of  the  linear  model. 
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1 .   Introduction 

Economic  theory  provides  information  regarding  the  range  of  vari- 
ables that  enter  into  functional  relationships,  but  it  is  especially 
weak  with  regard  to  the  specific  nature  of  those  relations.   It  is 
therefore  heartening  to  see  the  recent  attention  that  is  being  paid  to 
the  testing  of  various  model  specifications  that  may  arise  in  practice. 
Consider,  for  example,  the  class  of  data  transformations  involved  in 
testing  linear  and  log-linear  regression  models.   Two  different  direc- 
tions have  been  taken  with  regard  to  testing  such  models  against  each 
other.   Aneuryn-Evans  and  Deaton  (1980)  treat  linear  and  log-linear 
specifications  as  inherently  non-nested  and  base  their  testing  proce- 
dures on  the  modified  likelihood  ratio  (LR)  principle  of  Cox  (1961, 
1962).   This  approach  is  an  adaptation  of  the  classical  LR  test,  re- 
quiring estimation  under  both  the  null  and  alternative  models,  and 
leads  to  test  statistics  which  are  very  complicated  to  compute.   A 
different  approach  has  been  taken  by  Godfrey  and  Wickens  (1981),  who 
base  their  tests  on  the  data  transformation  suggested  by  Box  and  Cox 
(1964).   Although  both  linear  and  log-linear  variants  may  be  tested  by 
the  LR  method  against  the  artificial  compound  model  in  which  they  are 
both  nested,  Godfrey  and  Wickens  consider  two  approaches  which  effec- 
tively require  estimation  only  under  the  null  hypothesis.   The  first 
method  is  based  on  the  Lagrange  multiplier  (LM)  principle  and  leads  to 
test  statistics  which  are  asymptotically  chi-squared  with  one  degree  of 
freedom  under  the  respective  null  hypotheses.   A  second  approach  appeals 
to  the  work  of  Andrews  (1971),  whose  Taylor-expansion  of  the  trans- 
formed variables  leads  to  an  exact  test  of  the  (linear)  null  model. 


-2- 

This  paper  has  several  aims.   In  Section  2,  we  consider  the  effects 
of  truncation  on  the  test  procedures  suggested  by  Godfrey  and  Wickens. 
It  turns  out  that  the  LM  statistic  for  testing  the  log-linear  model 
is  unaffected  by  truncation,  whereas  the  LM  test  of  the  linear  model 
is  affected,  in  different  ways,  by  both  symmetric  and  unsymmetric  trunca- 
tion.  It  is  shown  in  Section  3  that  when  neither  the  linear  nor  log- 
linear  model  is  rejected,  a  choice  between  them  may  be  based  simply  upon 
the  difference  between  the  two  calculated  LM  statistics.   In  Section  4 
we  provide  some  new  procedures  for  testing  models  in  which  the  dependent 
variable  is  subjected  to  different  (known)  data  transformations.   Since 
these  tests  are  based  on  artificial  regressions  which  use  some  prior 
information  on  the  specific  form  of  the  alternative  model,  their  power 
properties  would  be  expected  to  differ  from  those  of  the  LM  tests  al- 
ready mentioned.   When  the  null  model  is  log-linear,  or  is  linear  with 
negigible  truncation,  these  new  tests  have  the  property  of  a  known  null 
distribution.   In  addition,  the  tests  are  conceptually  straightforward 
and  are  easy  to  calculate.   In  Section  5,  the  tests  developed  are  modi- 
fied to  incorporate  truncation  in  the  disturbances  of  the  linear  model. 

2.   Effects  of  Truncation  on  Some  Existing  Test  Procedures 
Consider  the  following  two  non-nested  specifications 


V  yt  =  ft(V6)  +  eoc  (1) 

V  ?t   -«t<«t!T)  +*u  (2) 

in  which  y*  is  a  one-to-one  known  transformation  of  y  ,  f   and  g  are 

known  (possibly)  non-linear  functions  with  arguments  (x  ;3)  and  (z  ;y)» 
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respectively ,  in  which  the  vectors  of  explanatory  variables  x  and  z 

t      t 

are  exogenous  and  the  vectors  of  unknown  coefficients  6  and  y   are  iden- 
tifiable under  H  and  H  ,  respectively.   Following  Aneuryn-Evans  and 
o      i 

Deaton  (1980,  p.  276),  we  assume  that 

2 
£   ~  NID(0,a  ). 
ot  o 

Different  transformations  on  y  to  obtain  y*  impose  some  restrictions 

on  the  possible  range  of  values  of  y  .   For  example,  if  y*  =  log  y  , 

2 
y  must  be  restricted  to  be  positive  so  that  e,   cannot  be  NID(0,a,  )• 
't  It  1 

Aneuryn-Evans  and  Deaton  (1980,  p.  276-277)  consider  symmetric  trunca- 
tion for  e,   when  y*  =  log  y  ,  in  which  case  the  density  function  for 

It  t  t'  J 

e        can   be   written  as 


2 
*(elt)    =  cx(m)p(elt;a1),         |elt|    <  mc^ 


(4) 


=   0  »         |eltl    >   ma! 

2 
where  p(e  ;o  )  is  the  probability  density  function  (p.d.f.)  of  an 

2 
N(0,a  )  variate  and  a(m)  is  given  by 

a(m)  =  {/   (2TT)"1/2e"1/2?  dg]      .  (5) 


-m 


Fo 


r  "large"  values  of  m,  a(m)  in  (5)  will  be  close  to  unity  and  '"'(e-,t:) 

2 
in  (4)  will  be  approximately  N(0,a,). 

The  approach  adopted  by  Godfrey  and  Wickens  (1981)  is  to  nest  two 

special  forms  of  (1)  and  (2),  namely 


H  :   log  y  =  So  ♦  II  iog  x.t  +  «  (6) 

1=1 


-<*- 


within  the  Box-Cox  framework 

k 

yt(X)  =  6Q  +  Z  eixlt(X)  +  et  (8) 

i=l 

in  which 

v  (X)  =  (v*-l)/A,    X  =f  0 


=  log  v   ,   X  =  0 

and  S.  =  Y.(i  >  0)  in  (8)  when  X  =  1.   For  notational  convenience  only, 
no  distinction  is  made  between  the  coefficients  in  (6)  and  in  (8).   It 
will  be  assumed  that  X    [0,®)  for  simplicity,  implying  that  the  para- 
meter value  lies  at  the  boundary  of  the  parameter  space  under  H  :   X  =  0. 

o 

This  does  not,  however,  affect  the  validity  of  the  LM  tests  of  H   [see 

o 

Moran  (1971)  and  Chant  (1974)].   If  the  effects  of  truncating  e   are 

2 
small,  the  disturbance  in  (8)  will  be  approximately  NID(0,a  )•   The  LM 

tests  of  Godfrey  and  Wickens  are  obtained  by  regressing  the  unit  element 

on  the  partial  derivatives  of  the  log-likelihood  function  of  (8)  with 

2 
respect  to  (3, a  ,X)  evaluated  at  the  restricted  estimates  under  the 

respective  null  models  H  :   X  =  0  and  H, :   X  =  1.   The  LM  statistics 

o  1 

are  simply  the  sample  size  times  the  respective  coefficients  of  multiple 
determination  from  the  artificial  regression  with  the  unit  element  as 
the  "dependent  variable"  and  the  partial  derivatives  as  the  "independent 
variables . " 

It  is  claimed  by  Godfrey  and  Wickens  (1981,  p.  491)  that  the  LM 
statistic  is  unaffected  by  a  particular  form  of  symmetric  truncation 
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of  e   in  (7).   Their  observation  was  based  on  the  fact  that  using  the 

truncated  p.d.f.  ir(e,  )  in  (4)  simply  adds  the  known  constant  log  a(m) 

to  the  log-likelihood  function  for  one  observation.   The  log-likelihood 

function  with  truncated  p.d.f.  is 

T 
£(S,a2,X)  =  I  &t(S,a  ,X) 

t=l 

where 

I   (S,a2,X)    =  -1/2  log   2tt   -    1/2   log   a2  +   (X-l)log   yt 

1  k  2 

-  -~  {Vt^)    -  B0  -  2  0ixic(X)}   +  log  a(m,X).   (9) 

2a  i=l 

If  we  consider  a  more  general  form  of  symmetric  truncation  within  the 
context  of  the  Box-Cox  transformation,  a(m)  in  (4)  and  (5)  would  be 
modified  to  incorporate  the  parameter  X.   The  appropriate  modification 
of  a(m)  is  given  by 

ct(m,X)  =  (/    ^(C)dc}"1  (10) 

-m/X 

where 

-V2  -V2E2 
<j>(5)    =   <2ir)         e     ILt>    . 

Notice  that  (10)  contains  distributions  corresponding  to  H  :   X  =  0  and 

H  :   X  =  1  as  special  cases,  and  that  a(m,X)  will  be  close  to  unity, 

+ 
irrespective  of  the  finite  non-zero  value  for  m,  as  X  *  0  .   In  parti- 
cular, note  that  a(m,X)  is  not  close  to  unity  when  X  =  1  and  m  is  not 
"large,"  since  a(m,l)  in  (10)  is  identical  to  a(m)  in  (5).   However, 
the  use  of  (5)  does  not  include  the  distributions  of  both  the  linear  and 
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log-linear  models  as  special  cases,  and  hence  does  not  exploit  fully 
the  presence  of  the  parameter  X  in  the  Box-Cox  framework.   Within  this 
more  general  context,  it  is  useful  to  examine  whether  the  LM  test  of 
the  linear  model  is  affected  by  symmetric  truncation. 
The  effects  of  truncation  can  be  evaluated  from 


3  log  a 
3X 


=  -  a 


8X 


f-m/X 
-m/X 


=  -  a 


SX 


m/X  -m/X 


=  -  a  •  [-(m/X2)4>(m/X)  "  (m/X2)(j>(-m/X)  J 


2  am 


<Km/X) 


in  which  a  =  ct(m,X).   It  is  easily  seen  that 


and 


lim 
X+l 


9  log  a    2m  (fr(m) 
3X     '  $(m)-$(-m) 


.    9  log  a        n 
llm+   3X    =  ° 


(11) 


m 
in  which  $(m)  =  /   <J>(£)d£.   This  shows  that  the  LM  test  statistic  for 

— oo 

the  log-linear  model  (6)  will  not  be  affected  by  symmetric  truncation. 
However,  for  the  linear  model  the  "regressor"  3£  /3X  is  not  the  same  as 
that  given  by  Godfrey  and  Wickens  because  we  must  add  the  constant  term 
2m  ij)(m)/($(m)-<!>(-m))  to  it.   Since  the  artificial  regression  equations 


used  to  calculate  the  LM  test  statistics  do  not  include  an  intercept, 

the  coefficient  of  multiple  determination  with  the  modification  given 

above  will  oe  different.   Therefore  the  LM  statistic  for  testing  the 

linear  model  will  be  affected  by  symmetric  truncation. 

An  alternative  form  of  truncation  mentioned,  but  not  analyzed,  by 

Aneuryn-Evans  and  Deaton  (1980)  is  that "of  Amemiya  (1973).   In  this 

case,  a(m,X)  in  (10)  is  rewritten  as 

-1 
a(3,a,X)  =  {/       $(C)dS} 

-x'3/Xa 

t 

k 

in  which  x'3  =3   +  Z   3.x.  .   It  is  straightforward  to  show  that 
t     o    .=1  i  it 


3  ^g  a  =  a  •  (x'SA2a)  •  4>(-x'3/Xa) 

d  A  t 


3  l0g  a  =  a  •  (-x  Aa)  •  4>(-x'8/Xa) 


36  t       T    C 

3  l°$   a  =  a  •  (x'3/Xa2)  ■  4>(-x'3/Xa). 
<3a  t  t 

The  above  derivatives  tend  to  zero  as  X  +  0  ,  whence  the  LM  statistics 
for  the  log-linear  model  is  unaffected.   When  X  =  1,  however,  the  par- 
tial derivatives  are  all  non-zero  quantities  so  that  the  regressors 
required  to  calculate  the  LM  statistic  for  the  linear  model,  namely 
dl   /3X,  di   /33  and  9£  /3a,  will  be  different  for  the  truncated  model. 
Therefore  the  LM  statistics  for  the  linear  and  truncated  linear  models 
will  be  different  from  each  other.   However,  when  x'6  is  large  compared 
to  a,  all  these  derivatives  will  be  negligible  and  hence  the  effects  of 
truncation  will  be  small. 
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3.   A  Discrimination  Criterion  Based  on  LM  Statistics 

An  especially  simple  criterion  for  discriminating  between  H  and  H 

is  the  Sargan  (1964)  criterion  which  is  based  on  the  difference  between 

maximized  log-likelihoods  when  truncation  of  the  linear  model  is  negli- 

gible.   If  £  ~i      is  the  log  of  the  likelihood  ratio,  H  is  chosen  if 

the  difference  is  positive  and  H  if  it  is  negative.   An  alternative 

o 

approach  may  be  based  upon  the  calculated  values  of  the  LM  statistic 

under  both  H  and  H, .   Denote  by  LM  and  LM.  the  LM  statistics  for  H 
o      1  o       1  o 

2  k+3 

and  H  ,  respectively.   Let  9  =  (B,a  ,X)f    0        .   The  asymptotic 

equivalence  between  the  LM  and  LR  statistics  under  the  tested  hypotheses 

and  for  local  alternatives  permits  us  to  write 


LM   =  LR   =  2[sup  £(9)  -   sup   £(9)] 
9         0|X=0 


and 


LM.  =  LR.  =  2[sup  £(9)  -   sup   £(9)] 
1     1  0         ©|X=1 


a 


where  =  denotes  asymptotic  equivalence.   Thus,  in  large  samples,  the 
difference  between  the  LM  statistics  is 


LM  -  LM  i  2[  sup   £(9)  -  sup   £(9)J.  (12) 

1  °     0|X=O        0|X=1 

The  model  selection  criterion  implicit  in  (12)  is  that  when  LM1  and  LM 


are  both  insignificant,  tL  (H  )  should  be  chosen  if  LM  -LM  is  negati 
°  1   o  i   o 


ve 


(positive).   Of  course,  a  similar  rule  can  be  applied  even  if  both  LM 

and  LM  are  significant,  although  the  selection  rule  will  not  be  con- 
o 

sistent  with  the  outcome  of  the  statistical  test.   Our  suggestion  of 
choosing  the  model  with  the  smaller  calculated  LM  statistic  is  obviously 
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very  similar  to  the  Sargan  criterion  of  selecting  the  model  with  the 

larger  maximized  livelihood.   Since  both  LM  and  L>L  are  calculated  in 
a —  0       x 

testing  H  and  H, ,  respectively,  the  discrimination  criterion  based  on 
o      1 

the  LM  statistics  is  very  straightforward  to  apply. 

4 .   Some  New  Exact  Non-nested  Tests 

The  LM  statistics  discussed  above  are  essentially  testing  the  spe- 
cial cases  X    =   0  and  X    =   1  against  the  Box-Cox  transformation  given  in 

(8).   Suppose  instead  that  a  test  of  the  (null)  hypothesis  H  :   X  =  0 

o 

is  required  to  have  high  power  against  the  specific  non-nested  alterna- 
tive H  :   A  =  1,  and  vice-versa.   In  this  section  we  develop  some  new 
and  computationally  straightforward  tests  of  the  linear  and  log-linear 
models  against  each  other  when  truncation  of  the  disturbance  term  in 
the  linear  model  is  negligible. 

When  the  mean  of  y  is  several  standard  deviations  above  zero  (i.e., 
when  m  is  large  for  the  symmetric  truncation  case),  e  is  approximately 
normally  distributed.  For  sufficiently  large  m,  this  approximation  will 
be  very  close  so  that  (6)  and  (7)  may  be  supplemented  with  NID  errors  as 

2 
H  :   log  y  =3   +  Z  8 .  log  x.   +  z      ,   e   -  NID(0,a  )    (6') 

0  t    o    .   l      it    ot    ot  o 

l 

2 
&-.  y  =  y   +  2  Y.x.^      +  el4.,   e,.  ~  NID(0,a  ).    (7') 

1  J  t    o    .   l  it         It    It  1 

l 

Combining  the  disturbances  from  H  and  H  yields 

o      i 


(l-a)(log  yt-B0-E  $±    log  x.c)  +  a(yt-Y0-Z  Y^)  =  et 


(13) 


-i  n- 


in  which  z      is  normally,  independently  and  identically  distributed  under 

both  H  :   a  =  0  and  H.  :   a  =  1.   Rearrangement  of  (13)  leads  to  the  fol- 
o  1 

lowing  two  artificially  constructed  models 


log  y  =  SQ  +  Z  B.  log  x       +  6  e   +  e  (14) 

i 

7t  "  T0  +  t   YdKlt      +  6l£oc  ♦  «t  (15) 

in  which  9   =  -  a/(l-a)  and  8,  =  -(l-a)/a.   A  test  of  h  :   0   =  0  in 
o  1  oo 

(14)  is  equivalent  to  a  test  of  a  =  0,  and  testing  H  :   9,  =  0  in  (15) 

is  equivalent  to  testing  a  =  1.   However,  as  (14)  and  (15)  stand,  H 

o 

and  EL  are  not  testable  because  e,   and  z        are  not  observed  variables. 
1  It      ot 

The  approach  we  will  adopt  is  to  replace  the  disturbances  from  H.  and 

H  in  (14)  and  (15),  respectively,  with  certain  estimated  residuals, 
o 

As  will  be  seen,  this  approach  has  the  dual  advantage  of  testability 
and  exactness. 

In  order  to  make  the  hypotheses  contained  in  (14)  and  (15)  testable, 

A  •• 

let  us  denote  by  3.  and  y  .  (j  =  0,1,..., k)  the  OLS  estimates  of  8.  and 

AAA 

Y   from  (6')  and  (7'),  respectively,  so  that  log  y  =  S   +  I  8.  log  x 
i  t    o    .   i      it 

-  i 

and  y  =  y   +  Z  Y.x.  .   Now  consider  the  artificial  regressions 
Jt  o    .   l  it  ° 

l 


exp(log  yt)    =  yQ   +  Z   y±x±t   +  \t 

i 

log  ;t  =MU.  log  xit  +  nQC 


(16) 
(17) 


in  which  the  dependent  variables  in  (7')  and  (6?)  have  been  replaced  by 
transformations  of  predicted  values  from  (6')  and  (7').   Denote  the 
residuals  from  (16)  and  (17)  by 


-11- 


n   -  exp(log  yt)  -  yQ  -  Z   Y±xit  (18) 

i 

not  =  l0g  yt  "  8o  "  Z  Si  l0g  Xit  (19) 

and  substitute  (18)  and  (19)  into  (14)  and  (15),  respectively,  in  place 

of  the  unobserved  en   and  e   .   The  tests  of  H  and  Hn  are  the  t-ratios 

It      ot  o      1 

for  the  OLS  estimates  of  9   and  Q.    in 

o  1 


log   y     =8      +  I    S.    log  x.      +  9   n.      +  £ 
&    Jt  o  .        1  °       It  o    It  t 


(20) 


yt=\+l   Vit  *  Vot   +  V  (21) 

l 

Since  n,   is  a  function  of  log  y   through  (16),  n.   is  independent  of 

the  OLS  residuals  from  (20)  under  H  ;  similarly,  n   is  independent 

o  ot 

under  H  of  the  OLS  residuals  from  (21)  because  it  is  a  function  of  y 

through  (17).   Therefore,  the  results  of  Mi Hi ken  and  Graybill  (1970) 

may  be  used  to  show  that  the  relevant  t-ratios  for  0   and  9,  from  (20) 

o      1 

and  (21),  respectively,  both  have  the  exact  t-distribution  with  (T-k-2) 

degrees  of  freedom.   Strictly  speaking,  of  course,  the  test  of  the 

linear  model  in  (21)  has  only  the  approximate  t-distribution  under  H 

because  e    is  only  approximately  normally  distributed  for  large  m. 

The  test  procedures  are  quite  straightforward  to  implement.   For 

example,  in  order  to  test  H  in  (7'),  we  require  only  the  following  two 

simple  steps: 

(i)   Replace  log  y  in  (6*)  with  log  y   (as  in  (17)),  and  obtain 

the  residuals  n    (as  in  (19)); 

(ii)   Test  the  significance  of  n   when  it  is  included  in  H  (as 

ot  i 

in  (21)),  and  reject  (do  not  reject)  H  if  91  is  (is  not) 
significantly  different  from  zero. 
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It  is  worth  noting  that  the  nesting  procedure  used  in  (13)  is  arbi- 
trary.  We  could,  of  course,  have  considered  a  weighting  scheme  of  the 
form 


(l-a)exp(log  y  -  Q     -  E  B.  log  x  )  +  a(y  -  y  -  E  y  x  )  =  e   (22) 

1  1 

so  that  the  test  of  rL  would  be  the  t-ratio  of  9.  in 

y   =  v   +  Z  y.x.   +  61exp(n   )  +  e  (23) 

Jt         'o    .   l  it     1   K   ot      t 

l 

instead  of  in  (21).   Although  the  t-ratios  for  9,  from  (21)  and  (23) 
both  have  the  exact  t(T-k-2)  distribution  under  H  ,  and  hence  are 
asymptotically  equivalent  under  the  null  and  against  local  alternatives, 
the  power  of  H  may  well  differ  between  (21)  and  (23).   Therefore,  it 
would  be  useful  to  compute  the  test  statistics  from  both  equations  for 
purposes  of  comparison.   Notice  that  we  cannot  use  the  weighting  scheme 

(l-a)(log  yt  -  8q  -  U.  log  x^)  +  a  log(yt  -  yq  -  E  Y..xit;)  =  £,.  (24) 

i  i 

instead  of  (13)  and  (22),  and  the  testing  equation 

log  y  =8   +  Z  3.  log  x   +  9olog(nlt)  +  et  (25) 

i 

rather  than  (20),  because  there  is  a  non-zero  probability  that  (y  - 

y   -  Z  y.x.  )  in  (24)  and  n,   in  (25)  may  be  negative  for  some  t. 
o    .   l  l t  It 

l 

A  related  problem  arises  in  the  derivation  of  the  Andrews  test  by 

Godfrey  and  Wickens.   These  tests  of  H  and  Hn  are  based  on  artificial 

o      1 

regressions  similar  to  (20)  and  (21),  in  which  n,   and  n   are  replaced 
by 
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qlt  =  V2[(log  yt)2  -  Z    3. (log  x.^"] 


and 


q    =  [(y  log  y  -  v  +  1)  -  2  Y-(x.  log  x.   -  x.   +  1)1 
Hot     wt   s  Jt        't        .   l   it   a   it    it 

respectively.   The  appropriate  tests  of  H  and  H  are  simply  the  t-ratios 

of  the  estimated  coefficients  of  q,   and  q   ,  respectively,  with  the 

It      ot  J 

test  of  H  (H,)  being  distributed  exactlv  (approximately  exactly)  as 
o   1 

t(T-k-2)  under  the  respective  null  hypotheses.   Thus,  both  the  Andrews 
test  and  the  new  test  derived  in  this  section  have  the  same  size  in  small 
samples  and  are,  of  course,  asymptotically  equivalent  under  the  tested 
model.   However,  their  respective  powers  may  differ  in  finite  samples 
when  the  alternative  is  "true." 

It  is  useful  to  mention  three  aspects  of  the  tests  developed  above. 
First,  it  has  been  assumed  that  truncation  of  the  linear  raoel  is  negli- 
gible, so  that  e1   is  normal.   There  is,  therefore,  a  non-zero  proba- 
bility of  obtaining  negative  values  of  y   (and  hence  of  y  )  for  some  t, 

in  which  case  n   may  not  be  obtainable.   In  such  circumstances,  an 

ot 

exact  test  of  the  linear  model  may  be  unavailable.   Similar  views  were 

expressed  by  Godfrey  and  Wickens  (1981,  p.  493)  in  relation  to  the 

Andrews  test  of  the  linear  model.   Second,  the  predicted  values  exp(log  y  ) 

and  log  y  may  be  used  to  replace  n,   and  n   ,  respectively,  in  (20)  and 
&  ^t  K      It      ot 

(21).   The  exactness  of  the  tests  of  H  and  H  is  unaffected,  although 

o      1 

the  small  sample  power  characteristics  may  differ  when  predictions  under 
the  null  replace  the  calculated  residuals.   Third,  while  the  exactness 
of  the  tests  depends  upon  the  normality  assumption,  their  asymptotic 
validity  does  not. 


5 .   Tests  in  the  Presence  of  Truncation 

When  m  is  not  large  so  that  truncation  of  the  linear  model  may  not 

reasonably  be  ignored,  it  is  convenient  to  assume  that  e,   is  distributed 

It 

as  truncated  normal  in  the  range  (-m  ,QO)  y    where  m  =  y      +  Z   y.x.   (see 

s  t  t  o         .      1    it 

1 

Amemiya,    1973).      The    p.d.f.    of   £,       can    then   be   written   as 


)    =    [F(2^%rle'l^2ljA 


Tr(elt)    =    [Ft(2ir)  '*o    ] 


where 


v         Am   M     ,-1  -^2/<J2Lf 
F     =  J      L(27t)      a   ]      e        *        Ld£. 

—  00 

2 
It  is  easily  seen  that  E(e]    )  =  a   (f  /F  ),  where 

ft  =  [(2.)\]-1.-,^/^. 

Now  consider  the  augmented  version  of  H   (see  Olsen,  1980,  p.  1817) 


HJ:   yt  =Yq  +Z  Vit  +  fiCf^)  +  e*t 


(26) 


where  5  =  a-,  and  e|  =  e   -  6(f  /F  ).   Note  that  the  disturbance  term 
in  (26)  is  heteroscedastic  with  zero  mean.   Therefore,  for  purposes  of 

testing,  h  can  be  replaced  by  H*.   The  quantity  (f  /F  )  is,  however, 

2 

unknown  because  it  depends  on  the  unknown  parameters  (y  »Y.»cO»   These 

parameters  may  be  replaced  by  their  maximum  likelihood  estimates 

~2  " 

(y  >Y.»0-|),  from  which  we  can  obtain  (f  /F  ).   A  suitably  modified  form 
oil  t   t 

of  H,  is  therefore  given  by 

H**:   y  =  y   +  2  Y.x.   +  5(f  /F  )  +  £**.  (27) 

1    Jt  o    .lit      tt     It 
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Equation  (27)  is  equivalent  to  (7 ' )  ,  except  for  the  appearance  of 

the  additional  regressor  (f  /F  )  and  the  fact  that  the  disturbance  terra 

is  now  heteroscedastic  and  serially  correlated  since  e£*  =  e*  + 

<5(f  /F  )  -  5(f  /F  ).   This  will  lead  to  a  modification  of  our  test  pro- 

v  t   t       t   t  r 

cedure.   For  testing  the  log-linear  model,  we  would  replace  equation 
(16)  with 


exp(log  y£)  =  yo  +  Z   Y±x±t    +  <S(ft/Fc)  +  n^ 

i 

and  the  analysis  would  proceed  as  before.   The  t-ratio  for  9   in  the 

o 

equivalent  of  (20)  is  asymptotically  distributed  under  H  as  N(0,1). 
In  order  to  test  the  linear  model,  the  truncated  linear  model  is  first 
estimated  to  obtain  the  predicted  values  y  ,  which  are  then  used  in 
equation  (17)  to  obtain  n    as  in  (19).   Finally,  (21)  is  replaced  by 

y  =  Y  +  Z  y.x.„  +  5(f./F  )  +  9^   +  e.  .  (28) 

J  t  o    .lit      tt     lot    t 

l 

In  this  case  it  should  be  noted  that  the  test  is  not  exact  because  the 
tested  model  is  not  actually  H  ,  but  rather  H**  (i.e.,  H  modified  to 
account  for  truncation).   Since  the  disturbance  in  (28)  is  both  hetero- 
scedastic and  serially  correlated,  for  valid  inferences  the  consistent 
covariance  matrix  estimator  of  White  and  Domowitz  (1984)  should  be  used 
for  testing  the  hypothesis  H** :   9   =  0.   The  appropriate  t-ratio  for 
9   in  (28)  will  be  asymptotically  distributed  under  H  as  N(0,1). 

It  would  be  interesting  to  investigate  how  these  modifications  to 
incorporate  truncation  in  the  linear  model,  as  well  as  the  LM  statistics 
discussed  in  Section  2,  will  affect  inferences.   It  is  worth  observing 
that  in  many  empirical  studies  the  log-linear  model  has  been  found  to 


-I  To- 
be  superior  to  the  linear  model  when  truncation  has  been  (rightly  or 
wrongly)  ignored.   The  results  might  well  be  different,  at  least  for 
models  based  on  survey  data,  if  a  suitably  truncated  linear  model  were 
to  be  tested  against  a  log-linear  model. 

6 .   Summary 

This  paper  addressed  several  issues  associated  with  testing  linear 
and  log-linear  regression  models  against  each  other.   The  analysis  in 
Section  2  focused  attention  on  symmetric  and  unsymmetric  truncation  of 
the  errors  in  the  linear  model,  and  showed  how  the  Lagrange  multiplier 
test  of  the  linear  model  is  affected  when  a  general  form  of  truncation 
is  used.   The  effect  of  the  truncation  depends  on  the  relative  magni- 
tudes of  the  regression  component  and  the  error  variance  of  the  linear 
model.   In  Section  3  it  was  shown  that  the  Lagrange  multiplier  sta- 
tistics used  to  test  the  hypotheses  can  also  be  used  to  discriminate 
between  them.   Some  new  computationally  straightforward  exact  tests 
which  treat  the  linear  and  log-linear  regressions  explicitly  as  non- 
nested models  were  derived  in  Section  4.   These  tests  may  easily  be 
generalized  to  handle  different  data  transformations  of  the  dependent 
variable.   While  the  exactness  of  the  tests  depends  upon  the  normality 
assumption,  their  asymptotic  validity  does  not.   These  tests  were  modi- 
fied in  Section  5  to  incorporate  truncation  in  the  errors  of  the  linear 
model . 
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